Near Real-time Data Warehousing with Multi-stage Trickle & Flip
نویسنده
چکیده
A data warehouse typically is a collection of historical data designed for decision support, so it is updated from the sources periodically, mostly on a daily basis. Today’s business however asks for fresher data. Real-time warehousing is one of the trends to accomplish this, but there are a number of challenges to move towards true real-time. This paper proposes ‘Multi-stage Trickle & flip’ methodology for data warehouse refreshment. It is based on the ‘Trickle & flip’ principle and extended in order to further insulate loading and querying activities, thus enabling both of them to be more efficient.
منابع مشابه
Near Real Time ETL
Near real time ETL deviates from the traditional conception of data warehouse refreshment, which is performed off-line in a batch mode, and adopts the strategy of propagating changes that take place in the sources towards the data warehouse to the extent that both the sources and the warehouse can sustain the incurred workload. In this article, we review the state of the art for both convention...
متن کاملTuned X-HYBRIDJOIN for Near-Real-Time Data Warehousing
Near-real-time data warehousing defines how updates from data sources are combined and transformed for storage in a data warehouse as soon as the updates occur. Since these updates are not in warehouse format, they need to be transformed and a join operator is usually required to implement this transformation. A stream-based algorithm called X-HYBRIDJOIN (Extended Hybrid Join), with a favorable...
متن کاملMesa: Geo-Replicated, Near Real-Time, Scalable Data Warehousing
Mesa is a highly scalable analytic data warehousing system that stores critical measurement data related to Google’s Internet advertising business. Mesa is designed to satisfy a complex and challenging set of user and systems requirements, including near real-time data ingestion and queryability, as well as high availability, reliability, fault tolerance, and scalability for large data and quer...
متن کاملFrom data warehousing to active information integration systems
Enterprises have gathered operational business information frommultiple structured data sources and stored it in a central repository, called data warehousing, for decision support functionalities and data analysis. The enterprises are now realizing to integrate their entire information sources, including "unstructured" contents, for deeper and richer information analysis. Several applications,...
متن کاملActive Data Warehousing: A New Breed of Decision Support
Active data warehousing is rapidly changing the landscape for deployment of decision support solutions. The trend toward actionable business intelligence demands that capabilities for tactical and event-driven decision-making be supported in addition to traditional uses of the data warehouse for strategic decision-making. The resulting challenges to deliver extreme service levels in the areas o...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011